Transductive Learning via Model Selection; Can Overfitting be Exploited?
نویسندگان
چکیده
A novel transductive learning algorithm is proposed, which is based on the use of model selection. In its simplest form there are k possible labels, m labeled points and one unlabeled point. One model is built for each possible classification of the unlabeled point yM+1 = Li, i = 1, ..., k, using all m+1 points and m + 1 labels. Any standard model selection criterion can then be applied to select one of the k models. The algorithm simply chooses the label Li that produced that model. We define the algorithm, show statistical justifications for it, and experimentally show the effectiveness of the algorithm when using a simple linear model combined with a model selection based on cross-validation.
منابع مشابه
Effective Transductive Learning via PAC-Bayesian Model Selection
We study a transductive learning approach based on clustering. In this approach one constructs a diversity of unsupervised models of the unlabeled data using clustering algorithms. These models are then exploited to construct a number of hypotheses using the labeled data and the learner selects an hypothesis that minimizes a transductive PACBayesian error bound, which holds with high probabilit...
متن کاملEffective transductive learning via objective model selection
This paper is concerned with transductive learning. We study a recent transductive learning approach based on clustering. In this approach one constructs a diversity of unsupervised models of the unlabeled data using clustering algorithms. These models are then exploited to construct a number of hypotheses using the labeled data and the learner selects an hypothesis that minimizes a transductiv...
متن کاملAnalysis and Improved Recognition of Protein Names Using Transductive SVM
We first analyzed protein names using various dictionaries and databases and found five problems with protein names; i.e., the treatment of special characters, the treatment of homonyms, cases where the protein-name string may be a substring of a different protein-name string, cases where one protein exists in different organisms, and the treatment of modifiers. We confirmed that we could use a...
متن کاملTransductive Learning via Model Selection
A novel transductive learning algorithm is proposed, which is based on the use of model selection. In its simplest form there are k possible labels, m labeled points and one unlabeled point. One model is built for each possible classification of the unlabeled point yM+1 = Li, i = 1, ..., k, using all M + 1 points and M + 1 labels. Any standard model selection criterion can then be applied to se...
متن کاملTransductive Classification via Local Learning Regularization
The idea of local learning, classifying a particular point based on its neighbors, has been successfully applied to supervised learning problems. In this paper, we adapt it for Transductive Classification (TC) problems. Specifically, we formulate a Local Learning Regularizer (LL-Reg) which leads to a solution with the property that the label of each data point can be well predicted based on its...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005